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receives (1) the received speech signal, (2) a signal indicating an estimate of the far-end 
noise level, and (3) a signal indicating an estimate of the near-end or ambient noise level. 
The dynamic range compression module performs a transform of the received speech 
according to the transform pattern shown in Fig. 2. 

Several drawback in the prior art systems exist which poorly address the issue of 
rapid variations in speech volume and very low speech levels. 
BRIEF DE SCRIPTION OF THE DRAWINGS 

The subject matter regarded as the invention is particularly pointed out and distinctly 
claimed in the concluding portion of the specification. The invention, however, both as to 
organization and method of operation, together with objects, features, and advantages 
thereof, may best be understood by reference to the following detailed description when read 
with the accompanying drawings in which: 

FIG. ] is a block diagram showing a speech intelligibility enhancement system 
according to the prior art; 

FIG. 2 is a graph showing the transform function or pattern of the system in Fig. 1 ; 
FIG. 3 is a block diagram showing a speech intelligibility enhancement system 
according to the present invention; and 

FIG 4 is a flow diagram showing the steps performed by a speech intelligibility 
enhancement system according to the present invention. 

It will be appreciated that for simplicity and clarity of illustration, elements shown in 
the figures have not necessarily been drawn to scale. For example, the dimensions of some 
of the elements may be exaggerated relative to other elements for clarity. Further, where 
considered appropriate, reference numerals may be repeated among the figures to indicate 
corresponding or analogous elements. 
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DETAILE D DF.SfTR TPTTDM 

In the following detailed description, numerous specific details are set forth in order 
to provide a thorough understanding of the invention. However, it will be understood by 
those .killed in the art that the present invention may be practiced without these specific 
details. In other instances, well-known methods, procedures, components and circuits have 
not been described in detail so as not to obscure the present invention. 

Unless specifically stated otherwise, as apparent from the following discussions, it is 
appreciated that throughout the specification discussions utilizing terms such as 
"processing", "computing", "calculating", "determining", or the like, refer to the action 
and/or processes of a computer or computing system, or similar electronic computing device, 
that manipulate and/or transform data represented as physical, such as electronic, quantities 
within the computing system's registers and/or memories into other data similarly 
represented as physical quantities within the computing system's memories, registers or other 
such information storage, transmission or display devices. 

Embodiments of the present invention may include apparatuses for performing the 
operations herein. This apparatus may be specially constructed for the desired purposes, or 
it may comprise a general purpose computer selectively activated or reconfigured by a 
computer program stored in the computer. Such a computer program may be stored in a 
computer readable storage medium, such as, but is not limited to, any type of disk including 
floppy disks, optical disks, CD-ROMs, magnetic-optical disks, read-only memories (ROMs), 
random access memories (RAMs) electrically programmable read-only memories 
(EPROMs), electrically erasable and programmable read only memories (EEPROMs), 
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magnetic or optical cards, or any other type of media suitable for storing electronic 
instructions, and capable of being coupled to a computer system bus. 

The processes and displays presented herein are not inherently related to any 
particular computer or other apparatus. Various general purpose systems may be used with 
programs in accordance with the teachings herein, or it may prove convenient to construct a 
more specialized apparatus to perform the desired method. The desired structure for a 
variety of these systems will appear from the description below. In addition, embodiments 
of the present invention arc not described with reference to any particular programming 
language. It will be appreciated that a variety of programming languages may be used to 
implement the teachings of the inventions as described herein. 

As part of the present invention, a received signal containing speech may be divided 
into frames having some predefined number of samples. A frame may then be analyzed to 
determine an instantaneous peak value, and from the instantaneous peak values of two or 
more consecutive frames, a long term peak value may be determined for each frame. For a 
igs given frame, an automatic gain control ("AGC") value may be calculated in relation to the 
difference between a desired volume or amplitude level and the frame's long term peak 
value. A noise factor between 0 and 1 may be factored into the AGC value. The samples 
within a frame may be increased or adjusted by the AGC value. 

A dynamic range compression ("DRC") gain value may be calculated for each frame 
and the samples within the frame may be adjusted by the RDC gain value. A noise tactor 
may also me factored into the DRC gain value. The DRC gain value for a frame may be 
calculated as the difference between the frame's long term peak value and its instantaneous 
peak value, multiplied by a maximum gain value divided by A L, where A L is a range value. 
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A target gain value for a frame, which is (he sum of the frame's AGC arid RDC 
gains, may be smoothed from frame to subsequent frame. One method of smoothing is to 
average a frame's calculated target gain with the previous frame's target gain value. 

l uring now to Fig. 3, there is shown a system according to the present invention. A 
received signal RX is decoded by decoder 10 and speech within the signal is detected by a 
voice activity detector 20. A noise suppressor 30 may remove noise from the signal and pass 
the signal to the AGC unit 40. The AGC unit 40 may compute an AGC gain value and apply 
the value to the signal, and the DRC unit 50 may also compute and apply a DRG gain value 
to the signal. Both the AGC 40 and the DRC 50 may receive a signal from a TX noise 
estimator 60 indicating an estimate of the noise on the receiver or near side. 

Fig. 4 is a flow diagram showing the steps performed by a speech intelligibility 
enhancement system according to the present invention. For each frame of a received signal 
an instantaneous peak amplitude or value is computed as part of step 100. The size of a 
frame may vary (e.g. 0. 1 to 100 ms). The instantaneous peak amplitude is simple the highest 
absolute value of any sample in the frame in dB: 

Qnstantaneous peak amplitude - 20*log(max(abs(x[n])))jJ 
As part of step 200, a long term peak amplitude value is calculated for a frame having speech 
present therein. Step 300 shows that speech presence in a frame may be indicated by a Hag 
determined either by a voice activity detection algorithm for a fixed-rate vocoder, or by the 
vocoder rate being maximal for a variable rate vocoder. A frame's long term peak amplitude 
value may be determined using the following logic: 

-""^ If(voice present) 

If(instantaneous peak value > long term peak amplitude value) 

long term peak amplitude value = instantaneous peak value 
Else if(instantaneous peak valuc>Minimum Acceptable Value) 

long term peak amplitude value = long term peak amplitude value - 
decrement peak value 1 
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The decrement peak value is chosen to decrease the long tcnn peak amplitude value 
by a nominal amount. 

As part of step 400 an AGC gain value is calculated. The AGC gain may be 
calculated by Subtracting from a target or desired level value LI the frame's long term peak 
amplitude value, m one embodiment of the present invention the calculated AGC gain may 
also be multiplied by a TX noise factor calculated as pari of steps 500, 600 and 700. 

A DRC gain value is computed as part of step 800. The DRC gain may be computed 
as the difference between a frame's long term peak value and its instantaneous peak value, 
multiplied by a maximum gain value divided by AL: 

r -7 & 

VDRC ga.nj^.ong^^^ DRC Gain/A L, ^ 

Where A L is a range value typically 20dB and Max RDC gain is also typically 20dB. The 
□RC gain value may also be multiplied by a noise factor such as the one calculated in steps 
500, 600 and 700. 

A target gain value may be calculated in step 900. The target gain value may be the 
sum of the AGC and RXXT gain values. Smoothing of the computed target gain is performed 
in step 1 000. Each sample in each frame may be adjusted by the smoothed target gain value. 

While certain features of the invention have been illustrated and described herein, 
many modifications, substitutions, changes, and equivalents will now occur to those skilled 
in the art. It is, therefore, to be understood that the appended claims are intended to cover all 
such modifications and changes as fall within the txue spirit of the invention. A description 
of a second embodiment is attached as Appendix A. 
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AVC pdchann2.5 algorithm doc a a 

ai^umnm.aoc Adoram Lreli 5/12/00 

RX-AVC: Automatic Volume Control Algorithm 

(Internal distribution only) 

1 Background 

Zlc^l and 7 adVan ° ed fU " Cti0n that automali ^'y -ntrols the rece.ved 

speech level and dynamic range. The AVC is a collection of 3 functions The 3 function, 
can be acUvated both separately and together. The 3 functions are: 

1 Automatic gain control (AGC) 

2 TX noise lever" ^''^ *** - m P ress '°" (DRC) as a function of 
3. Dynamic range compression (DRC) for speakerphone 

^"^z Si^r how ,he rx - avc ** ^ «» °^ 
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Figure 1: Audio path with RX-AVC. 
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2 Algorithm description 

The total AVC gain is the sum of 3 gains- 
1. AGC 

"ijng nags (AGC_FLAG and NOISE rpcphmcc ct a „ 
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Figure 2: block diagram 
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2. 1 Blocks description 

2.1 .1 short-term peak & short-term nns level 

Computes maximal sample energy over a sub frame several samples (default- 1 6) 
Computes rms energy over the sub frame 

2.1.2 Compute signal envelope level & speech-RMS 

Envelope is updated if RX-VAD detected speech and if the local peak is above a 
threshold; r 

if (RX-VAD &,& local_peak > MiN__ENVELOPE) 
{ 

if (iocal^peak > envelope) 

envelope = rniTi(local_peak, envelope + INCREMENT_PEAK) ; 
else 

envelope -= 1) EC REM ENT_PEA K ; 

if (rms > Jong_term_rms) 

long_term M rms INCREMENT^RMS ; 
else 

long^tem^rms -= 1NCREMENT_RMS ; 

) 

2.1.3 Compute ACC gain 

Basic relation: agc_gain -- (LEVEL 1 - envelope) ; 

Limitation 1 : agc_gain = min(MAX_AGC_GAIN, agc_gain) ; 

Limitation2: agc_gain = min(MAX__RMS . lon^Lerm_rms, agc_gain) ; 

2.1.4 Compute DRC gain 

Given the following noise dependent andRX- signal independent parameter: 
drc_gain - MAX_DRC_GA1N * max(drcjain_factor. noise_factor) ; 

Compute for each sub-frame the following gam (frame.drcjjain) as a fiinction of it, local 
peak and the envelope level as depicted in the following figure: 
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Figure 3: DRC gain function 

2.1 .5 Estimate TX noise level 

Sec Period_alg.doc 

2.1 .6 Compute Noise factor 

Noise factor is a function of noise level: 




Figure 4: Noise factor 
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2.1.7 Gain smoothing 

Signed exponential smooching: 

if (target_gain > smoolh_gain_db) 
alfa = ALFA UP; 
else 

alfa = ALFA_DOWN ; 



smooth_gain_db = smooth_^ain_db * (1-alfa) + alfa * target__gaiii 
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3 Parameters 



3. 1 Tunable parameters 
AGCFLAG 

AGC enable/disable control bit 
NOISE_RESPONSE FLAG 

^XTcSr for the amb,em noise - induced — — 

Controls the amount of DRC, ranging from 0 to 0*7FFF. Defaul = 0 • 
LEVEL1 

cTp^vir * e ***** C " Ve ' 0Pe Detalt "** "3 «*.. (3 d B Wow 

HIGH/LOW_AMBlENT_NOISE 

See figure 3. The default values n-e io ^n^ /■ 
^*««ho.* wiling 

modified if the TX analog P ach gam ,s ch^pt e » 17^^°!^ ^ * 
« mcreased by X dB, these thresholds should be iZ^^^^ ^ 
MAX_AGC_GAIN 

iSK.'iSir. m m ° dU,e " C °™°C: g >,„- (see 2 ,.3>. T h e 

MAX_DRC_GA IN 

decrease the effect of the DRC. Decreasing this parameter will 

MAX_RMS 

The AGC gain is limited so thai the sienal R\4«3 

MAX_RMS (default = -| 5 dBov) amphf.cation does not exceed 

MiN_ENVELOPE 

Signals whose envelope )S smal.er than thts threshold do not affect the AGC. 
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3.2 Soft constants (default) 



N UM_S U B_FR AMES 
SUB_FRAM_LEN 
LOG_SlJB_FRAM_LEN 
D LEVEL 12 (see Figure 3) 
DLEVEL23 (see Figure 3) 
DECREMENT_PEAK 
INCREMENT_PEAK 
INCREMENT_KMS 
ALFA_UP (see 2.1 J) 
ALFA.DOWN (see 2.1.7) 



(10) 

(FRAM_LEN/NUM_SUB_FRAMES = 16) 
(!O*logl0(SUBJFRAM_LEN)= 12) 
(24) 

(26) 

(3*SUB_FRAM_LEN/8000 = 3 dB/sec) 
(1 = 1 dB/sub_frame) 

(10*SUB_FRAM_LEN/8000 = 10 d!3/sec) 

0.1) 

(0.5) 



7 



